A Bootstrap Approach to Automatically Generating Lexical Transfer Rules
نویسندگان
چکیده
We describe a method for automatically generating Lexical Transfer Rules (LTRs) from word equivalences using transfer rule templates. Templates are skeletal LTRs, unspecified for words. New LTRs are created by instantiating a template with words, provided that the words belong to the appropriate lexical categories required by the template. We define two methods for creating an inventory of templates and using them to generate new LTRs. A simpler method consists of extracting a finite set of templates from a sample of hand coded LTRs and directly using them in the generation process. A further method consists of abstracting over the initial finite set of templates to define higher level templates, where bilingual equivalences are defined in terms of correspondences involving phrasal categories. Phrasal templates are then mapped onto sets of lexical templates with the aid of grammars. In this way an infinite set of lexical templates is recursively defined. New LTRs are created by parsing input words, matching a template at the phrasal level and using the corresponding lexical categories to instantiate the lexical template. The definition of an infinite set of templates enables the automatic creation of LTRs for multi-word, non-compositional word equivalences of any cardinality.
منابع مشابه
Implementing Comprehensive Derivational Features in Lexical Tools Using a Systematical Approach
A systematic approach for automatically generating derivational variants based on the SPECIALIST Lexicon was proposed and implemented in Lexical Tools [1]. This approach addressed the prefix (PD), zero (ZD), and suffix (SD) derivations from nominalizations (nomD). This paper describes the generation of SD (not from nomD) based on the Lexicon in the Lexical Tools, including both SD-Facts and SD-...
متن کاملGenerating Training Data for Semantic Role Labeling based on Label Transfer from Linked Lexical Resources
We present a new approach for generating role-labeled training data using Linked Lexical Resources, i.e., integrated lexical resources that combine several resources (e.g., WordNet, FrameNet, Wiktionary) by linking them on the sense or on the role level. Unlike resource-based supervision in relation extraction, we focus on complex linguistic annotations, more specifically FrameNet senses and ro...
متن کاملAutomating the Generation of a Wide-coverage LFG for French using a MetaGrammar
In this paper, we explain how the notion of MetaGrammar, which has successfully been used for generating wide-coverage tree adjoining grammars (TAGs) for various languages such as French (Abeillé et al. (1999)) and German (Gerdes (2002)), may be used to generate a wide-coverage Lexical Functional Grammar (LFG) for French. We first introduce the notion of MetaGrammar and present the tools we use...
متن کاملAutomating Post-Editing to Improve MT Systems
Beyond manual and automated postediting, we describe an approach that takes post-editing information to automatically improve the underlying rules and lexical entries of a transfer-based Machine Translation (MT) system. This process can be divided into two main steps. In the first step, an online post-editing tool allows for easy error diagnosis and implicit error categorization. In the second ...
متن کاملText simplification using synchronous dependency grammars: Generalising automatically harvested rules
We present an approach to text simplification based on synchronous dependency grammars. Our main contributions in this work are (a) a study of how automatically derived lexical simplification rules can be generalised to enable their application in new contexts without introducing errors, and (b) an evaluation of our hybrid system that combines a large set of automatically acquired rules with a ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره cs.CL/9907017 شماره
صفحات -
تاریخ انتشار 1999